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Review 

Human aldehyde dehydrogenase gene family 

AUin YOSHIDA. RZHETSKY-. Lily C. HSU- and Cb«e CHAHO' 

(R^ved 3 Wy/17Novcmb« 1997) - EJB 97 0937/0 

/•AT nm Beoea have been identified in huniflns. Theso gwies. located 
twelve aldehyde <><*y'to'S^t ^j^^S^f Soxidizes varieties ofaUphatic end erom^c 
on different chromosOTties. encode a gronp ^^^^^^i^^j wiih mirtetions of ALDHl, ALDM2. 

conserved in the family '«««*ers. sequences of humans, animals, fimgi. protozoa w»d 

n>e phylogenic tree constructed of A^^i^"™^'^^^ j^^ed from foui ancestral genes that 
eubacieria. suggests that *e P'-*?';^^ ^"^SSS.^ ndghbor-jolning tree derived fiom 12 
existed prior to ihe ^'Vf g«noe Eub«^ a^ gene closti* occurred 

human ALDHs and anUquitm indicates DupHcation in the ALDH 31101718 gene 

during the Ncoproterozoic period C*°f J^^^^S y^^^ Scpaxadons of ALDH3IALDHI0 

clittt« occurred in PhraciOMic penod ^f'?^^\^'^^^Jt^^ radiation of mammalian 
Ind that otALDiniALDHS bad occuned dunng the period oi eppranm 



species. 

Keyy^onts: gene family; 6»t»niic organization; 
tion; phylogenctie tree. 



genedc disease; genetic variant; dctoxiRcaiion; evolu- 



lindem viituaHy irreversible reaction. ALDHs m wioejy ois^ 

!5,?^N/^ far two distantiy related ALDHs. i.e. succumc 

Ti Most not all, corrcspoadrng members of the AU^n laimiy 

^tcDnL for tnore to 50 aniratils, fune,. and bactcnal ALDHs 
have been rcporced. 

^ « A YMhida Dept. of Biochemical Ocnctics, 

Bectanan Research Instiixiie of the Cixy of Hope Duarui, CJ^ 
^^\u^-.finy^ PlUYL aldehyde debydroecnase; SSDH, succimo 

nucleotide. „_rf.,t.j;- en? ««vi«iw /PM which wfll 

ATofs. This Review wiU M reprinted m a«vwivs 

be available ii> April 1999. 



This paper reviews the functional and struerural diversity and 
evolution ofthe human ALDH gene family. 

S is no uiwform nomedclamre system for human and 
^^AWH genes and enzymes. Therefor^ for 
deviated hu^n gene symbols (OBD ^y^bols) a« u^ fo 
genes (in italic) and enzymes Cm non-itJiUc) m the present re 
view. GcnBank identifieation numbers are also given. 

Members of ALDH femUlcs 

Twelve known human ALDH genes and 
^ymes are listed in T^ble 1. These genes consist of lO-^^cod' 
and span 11-40 kbp in various chromosomes. 
2^dS does not contain inirons in the coding region. Oenomic 

Proteins (enzyme subunits) encoded by these f « ""ff 
of *om 500 amino acid residues. Catalytically acove forms of 
trS^ arc homodlmers (ALDH3, ApH4) homotetra- 
mL^Dm,ALDH2.AlDH9,MMSDD) or unknown. 

%Sue dstribution and subcellular Iccalizatien of indivKiuel 
A'LDHs are also shown in Tabic 1. . * j - 

More r^cenSy. an additional human cDNA, tenmnvely dw.^ 
JJ es AlS CDNA, was cloned This «I>NA encoing 499 
^ acid residues, is 

duced amino acid sequence is highly similar (72 /o) to miman 
^DHl (Hsu et al. unpublished observation). Since the se- 
and ?JopUes of AIIJHll have not yet been pubhshed. 

it is not included in Tabic J . 
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Gene 
(GDB 
symbol) 



AlDHJ 
ALDH2 
AWHS 



(3l)brcviawd 

aLDHI 
ALDH2 
ALDH3 



Tissue . 
(subcellular disnibuhon) 

liver, stomach 

liver, saymath etc, (nut) 



Major 
subsuaic 



Codmg Subunii O^o- 
sMueace size so^*^ 



^PK4 A3.DH4 liver, IddntyCtni^) 



ALDH5 

Atom 



ALDH5 
ALDHe 



^iPH7 ALDH7 



ALDH9 
ALDHIO 




ALDH8 
yABDH 

faldh 

SSDH 



testis, liver (mit) 
sdivary glend, stomaclu 
kidjicy (cyt) 
kiOncy, lung (micro) 

paTOtid (micro) 

liver, kidney, musde (cyt) 

liver, heart, muscle (micro) 

brftiii. liver, heart (mit) 

Sidney, liver, heart (mit) 



rcnnal 

ecctaldehyde 
^tty and aromatic 
aldehydes 
gltuaznaie 
7-semialdehyd« 
propionaldjchydo 
jjiphatic aldehyde, 
retinal 

aliphatie and aro- 
matic aldebydcs 
unknown 
amine aldehyde 
itpy and aromauc 
&ldeh>'dcs 

5cmialdehyd<^ 

tncthylmalonatc 

scimiddehydc 



501 
517 
453 

563 

517 
512 

468 

451 
495 
435 



535- 



500 
452 

539? 

500 
511 

467 

450 
4^ 
484 

4&S 

503- 



9q2l 
12^^ 
17pll.2 



9pl3 
I5q26 



[8] (J04748) 
[9] (M20444) 
[10. U] (M7742^ 

[12](U24266) - 

[13] (M63967) 
[14] CU07919) 



11(113 U5. 161 COlO«^ 



llql3 
1(^22-24 
17pll 



[16] (U37519) 
tlT] CU34252) 
[IS, 19] (U46«J> 

[6, 20] (L34«2Q 

[71 (M93405) 



l^cedfrornibehaoiolagausiBt cD>^ A and enzyme. 



rinnitin cDT^ A (GcuBank S74728) [211 is distonily 
Human witiquittn ^f^x^^^^kp AlX>Hs, Human anUqui- 
relat^d (similarity 15--25%) to ™^ ALDH-like protein 

sxri";/^^^^^^^^^ J ^^y- -^^^^ - """^^ 

antiquirin has ALDH activity. 
Functional diversity 

hydes. FCT ««>=J- ALDIfc. NAD a ^ ^^^^^ 
ALDHs have em&rg ed 

utcd in various ^i^f*" *»^^^oSrof teS aU-frn^- «id 
enryme to ^^$,^3' ^ ^^.r.^\ at pH 7.S) [25] 
9-cis-rennal (*^<'' \jrl,7\r:i^ ftimation of lettooic acid 

wUch is apot«t mo*^Y to ^^c*^^^ ,^ ^ 

fcicntiatiMi-^W rolm reeepto>negative testicular 

patients associated ^jL'^ff^^^^ed in oto tissues [26. 
femlnizfrfion. aldioughtb* S«fe >f *'^]; n#i gene is mediated 
by androgen receptor md generanon otrewg' 
i/requiredfortest.«tod.^^^^ 
.'^''^^'m^So Sa«Wthe regulation of ge^ 



j-^^ mniiwi and Tflt ALDHs, v'hich are In^ly 

phamidcanapiaysaroic w u was denionstraied 

Leer drug., o-W«M>™ 1^ ^^^^d ^ the n^- 
Aat the acquired in the cells [35]. 



^ '^ALDHl is a major soluble constituent of eye lei.8 audniay 
play^Sfe in dS«.tion of pewxidic aldehydes pioducedby 
ultraviolet light absorption [37]. 

ALDH2 ' ^ ^latre a maior rOle in acetaldciiyd& 
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m^y nun ALDH2 aotrnty, AIDH2 is probably not essential 

for survival 

AtDHS ALDH3 oxidizes aromatic aldehydes and mediuin- 

^ owte^l (or m^etectable) in the uotmal liver- ALDH3 jS 
«™»sed in about 70% of poorly drffeieotiaied tmd 

[40]! Etiological rtdationships of hepatoma ALDH3 expres- 

"^t^rA^S .xpx.ss.on ^ observed 
induced rtt hepaton,n. andtnmjient ^Pj^^^'^^^^eX 
locytes treated with aromatic hydxocaibon xcnobiotcs (levjew 

" ^TS'oxazaphosphorine insistence was substantially d^g 
in temsto ceU Itoes tiensfected with rat or h«man ALDK 

S>"s.nicts [421. It was '^^^^^^^^'.^f^^X 
nhorine iwisiflnce was ftcconvamcd with 
C of /SLDH3-lik6 cnzytne in human breast and paro«l gland 

'^tnraSrt^S aldophOSJ^midc with pun^d 

.tnJlch ALDHJ and the ALDH3-likc enzyme pr<?.fl*ed fiom 

^^^stiJ Ttoo weak to aeeomt for fee resistt^ce 

52-45] mfs. it ^vas suggested that ^^O^^^^^'^LT^^ 
acuity of ALDH3 could be «ore severely d„,am^ 

benzaldehydc oxidation activity dunng P™fi<»ti»«- 

be extremely sensitive to ^'f'S 
wWdiwould lilcely itaet with many nuckophles and be ehim- 

^oelSV^^^'duced thO -^cripts mM^^^^ 
Ma con I^sfeeted with single cloned hnman M^ffi eDNA 
flrtiduces multiple ALDH3 componcnK. Thus, ALD»3 mum 
rMTbe due to post-iranslatlonal p^rtein mod.ficat«>a 

A variant -4IDi/3 (vf f ^^^^ 

sbout 0.25. C-G transveision at nt 985. Pro-Ala a* P"«jn 
po^ on 329) exists in bo4t Caucasi^s Ori^l^ 
Awording to the recent X-iay crystallographic jW^yof «t 
Pro329 exists at the end of short /^9. and a Pro-Ala 
CSi^^tio" eould cause struetoml and fimChonal changes [48] 
HSve!°noXi0J0fiical problems were n^nedm lie vanant 

""^m existence of ALDH3 r»ria«.t(s), dbtinprished ^cle^ 
trophon^is Of isoelectriefocusing, was ^f,^^'^,^.'"^ 
«nd Orientals [46, 49]. Hie mutation site(s) of the vanantQs) Has 
ro?2erS«^eVand it is not clcarj^ - "^^J 
ant(s) is identical to the common variant described abova 

XLDH3 is amajor eoustiiuent of comeam bum»B andhother 
animals [37. 50] and,like ALDHl. it may plsy arolem det«c,n- 
ctttion of peioxidic aldehydcs- 

ArDH4. ALDH4 is a mitochondrial enzyme with a Wfih 
octiv^for oxidation of y-5«lialdeh><tes sucb as i^^^ ^ 
SdeMe (a hydration p«)dnct of pynoline ^^f^^f^ 
Genetic deficiency of ALDH4 dbtmbs piolme dcgi*- 
Son Ld 4-Abu formation, causing type D ^m>^^'^ 
Sated with elevation of plasma protae evel, mental re^ 
S and eonvdsion [32. 53]. Tte mtnaUon st» of the d^^ 
fecUve £ene(s) has not been dctciroined. 



Hum 
Slab . 

AISN? 



190 op 



FIT 1 Cowjuiiiien or genomie organtofion of eJ^i «>"i»an 
«n« ST^tainiil the iranslaicd regions (shadowed) aic shown. 
Si do^nrZ^iiintmns in ia. ceding exon; the genonne organ,- 
luion of other tarnim AIDH ffa" is unknown. 



arnm rvABDH). vABDH is a cytosolic 6n25rmfi wifti a 
hifiifS^ f^<«i^tion of 4-aminobutyTaldahyde [54] The 
Si^S S expressed at a low level in aduh bram, but rt is 
™hr et5)ressed in the early stage of embryonic biam (gesta- 
agl of <12 weeks) [17, 54]. 4-Ab« may be pro*^ed 
puLeine raiher than from gtatamate, by fj''^™/'*^ 
and yABDH in the mammalian etnbryoiuc brain, as has been 
observed in the avian emtayonic taa«.[55]; 

The ALDH9 looos is polymorphic, i.e. C or T at nt 327. ana 
G rC^r34?.^e ^sTVommon (>50»/. in 
Ltojpe is C at nt 327 and G at nt 344. «id haplotypos of 
Sg344 and C327/C344 arc less common (about 20% each) 
Jfv] -nie transition C-^T at nt 327 is sUent. but ^^^^^"i^" 
G^C at nt 344 should induce the amino acid sabsumuonCys 
at protein position llS and eould alter «iZ3™e prop^«. 
The notion for the existence of "^^^-^P^^f '^^'s^ 
vABDHs (Uvcr enzyme with Ser, bram enzyme witii CyS) [56] 
is untenable. The difference is due to the existence of polymor- 
phism at tiie single ALDHQ lOCTS- 

ALDmO (FALDH). FALDH is a micit>6omal enzyme widi 
a high activity for oxidation of medjum-chain f f*=^ " 
hyd« (fetiy aldehydes). The gcnehc deficien<qr of FALDH dis- 
^ the mctaboUsm of membrane lipid causmg ^l*^"?;^*" 
sOTsvndtcme. an mherited disorder cbaractetized by ichthyosis, 
nSS^l ^blems and oligophr*ma [57). Five di^i- 
types Of mutations wete found in the patients exammcd [4^57]^ 
^described above. ALDH3 also has ahigh activity for oxida- 
tion of fatty aldehydes. Althotigh organiZBtion and snuchne of 
ALOm indALDffiO are higWy similar, ALDWOb^s a longer 
exon 9 and encodes the COOH-terminal transmembrane domam 
Wttci!!J rSomal enzyme, while laefcs the cone- 

Ljnding sequence in exon 9, producing a cytosolic cn^ 
^ 1). TT^ ALDH3 probably cannot supplement the inle Of 
FALDH in die syntiiesis of membrane Hpid, 

SSPH. SSDH, whi<* is strongly expressed in brai^, has a 
hieh activity for oxidation of succinic semmldehyde [20]. Oe- 
nrtic dcficiLo- of SSDH disturbs the oxidation of sueemic 
semialdehyde which is produced ftom 4-Abii by 4-Abu-Qhi 
^^ninise. causing accuniulation of sucmic sennaMehyde 
and psychomotor leiardation [58]- The mutation site of the gene 

has not been deteimmed- ^ 

In addition to SSDH, adalt bram contams ALD«1. ALDH2 
and AIDH5. yABDH is also expressed at a low level m adult 
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ALDH1.PRO 
ALDH6.PRO 
ALDHZPRO 

ALDHfl.PRO 
SSDRPRO 
MMSDH.PRO 
AL0H3.PRO 
ALDH10.PRO 
ALDH7.PRO 
ALDH8.PRO 
ALDm.PRO 



f^^.,;^ jJl human ALDH family members CFig. 2). . 

ALDHs. Fun^Oj^Uo^ J^^^'d^^g^U and *ray 

have been tdenmiea rtii if marked is im- 

amino acid tesiflue oio !^Vt., uvrf wossIy reduced Ihe ca* 

of bovine At.DH2 ™Worts this noU<>nt631. 

whid. foims a X loss of a catalytic 

^J^g^ESirSXe^-aT^^^ of bovine ALOm [63] 



.r IMT1 iT nm and AI.DH3 can oxMtae dopaldehi«ie 
brwo. ALDHl, ALDH2 Al" . „oi but Iheir roles in do- 
(3 A-dihydroxyphenyl acctaldehyde) \59U out 

p^ine l'Mn^*!»^,'^S"could be attributed to ALDH5 

^ '^S', for oxidation of short- 

gene product, exhibited a tagh activity 
chain aliphatic aldehydes [60J. aehydrogenase in the 

MMSDH is the oiJy CoA-'lepeiKl«atae y 6^ ^^^^ 

ALDH family, "^.f^ '^^flT'^^Z^\Z>i^y^ 
Srcr^rSS&^^o- S ly^^^^volvcd in trte 

ALDH 6 nas acnvuy i^jyoWcd in retinoic acid 
Icnown. 

Structural similarity 

ftf the amino acid seqaenccs of huinan 

A coiupanson of the * ^tnone the mem- 

ALDH5 indicates a wide .^^^"^^^^ |aps, ranges 

of the other members. ^ t^^^^on nrcamzatioiis revealed 

^i;'' rC^^r3To.1JS^«^ Motion position 
K^'^S^^incachexoncoin^dewithineach 

gro^ (Fig- 

to ALDHl, ALDH6. ■^^^ ,^^'J2^> 1 Md 6.N &r 2, Shown rtcve ""^^^ fa Vh.ded boxss. Kotc ftat only 

^trh^iirw-eT^^^^ 
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POfllSTALDH- BjrBaxHa t^fifism 



-p wft i AtDHiyaast 



,P4O0<rAU3H Cput>yaasr 
.523223 AU3H a aftgWtt- 



100 J_P05D91 ALDH2iTUlTiBn 

-P^B37'ALDH5 human 
.,5444a2JUi3H6 Mttflfln ' 
-P27iie3 AU3H1 cWctert 

971 j ga^ P24Si^ AL0H1 ftiousa 

^oSl n P13S01 ALDH1 rat 

" _40a4S3AU3H1.ahiaw 
,P0(nS^L£3H1 Wimait 
„ pi543rALDH1 hWW 




a^Q^2S£5AM&DH human 
L-aa7536 MMSDH bovina 




P4S2S6 aLDH a AJOfSb 

P33CKia AUDM Psflt/aomofias w- 
y2S32AA ALDH*-tTumar 



.P2SS3ALDHA£.«t5f 

.PAfiES^AUOH-Cpuc) (Ta/ffgafTS- 

.P12B93 aLDH a oloavoisncs 

.paOB4a AUJHI £./lBStaBi(33 



P433SX ALDH7 human 
.ALOHanuman 
.ALDH3 Human AEDK/^/S/lO 
Pliees ALDH3 rar 
^LDMIO humarr 
P30339 ALDHIOrar 



.paaasTAUDH (put) yroar 



SSDH 



AEDH9 



■PZ3SB3 ALDH (pUL) ^tiOff 
P23Z40 AU3H VfOsiOvMstm- 



group is nol included m the ace. 
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Fig. 4. A cCighDaHoininB twe derived from j^.^^-^j^" f^^^J^J^ 

ftomVa dialysis. Pootetrtip valuca above 50% arc ^^d^cawd. The hm« 
o^S^cnce'wer. estimated by linearized '^SojitKm " 
on ihc known ALDH pnMein sequences Cfom rodents, P""^^^^ 
orcyoda^tyls, *T.d on the divergence time 104 million ycaiB (MyO 
for RodMHia/Artiodnayln bifurcadon [77]. 

«Cid position 236 in Fig. 2) was also i«^P^^<^^^^*^ j?;?^^,^^^^^ 
by a site-dircctcd mutagenesis smdy of human [66, 67]. 



eubocteria. The age of the old^^t ktjown extinct eukaiyotes. Cry 
noTiifl irzpirato, was estimated to be about 2 billion agOp 
Vhcreos the oldest traces of ancient Ufe on the earth ait believed 
to be approximately 3.6 bilUon years old [71]. 

A standard rtathemaHeal analysis of the recanatrueted ph^do- 
ccnic tree derived fcom the intron-exon oiganization of me nur 
^ ALDff genes strongly suggests that ^^^^enes evo ved 
from an ancestral intronless geaic by mtron mscrtion. rattier than 
by a scries of intion deletions of an anoestrvil geae with mtions 

very cwly appearance of ALDH is fiot surpriaing. sinee & 
non-speeific AIDH witJ> a broad substrate specificity was csstn- 
Hfll to protect early life on the ancient earth which was nch m 
varieties of highly reactive and toxic ^^^^^y^^.^,^f^J^^ 
of the present day human ALDH i502ymes, wbch display di- 
verse tissue Wtd subceflular distributicms, is stll detoxification 
of biogenic and xenobiotic aldehydes existing m given tissues. 

AU)H5 with more specialized activities had evolved from 
the ancestral ALDHl, A1;DH6 and the .r^^^ently ^den- 

tified ALDHU which have high activities for oxidation ofreu- 
nal, had evolved together before the radiation of multcelluiar 

orga^S^ and ALDHIO. which may be involved irt the metab- 
olism of membrane hpids, had evolved tdgcthcr during the pe- 
riod of appearance of mammals. Meanwhile, two distantly re- 
lated enzymes, ijt, A1DH4 which participates in 4-Abu synthe. 
sis. and SSDH which participates in 4-Abv degradation, were 
separated much earlier. atxw^ 
We apologize to many investigators whose work on ALDHs 
were not cited because of Space limitations and because theu- 
work is beyond the scope of this review. 



Evolution Of AlDH 

A general neighbor-joining tree was derived firtjm 56 ALDH 
sequences of humans, animals, fungi, protozoa a^id bactcna 
(Fig 3). A human neighbor^joinmg ttee was derived from 12 
human ALDHs and antiquiiin (Fig. 4). 

There is inconsistency in the branch point of human SSDH 
and MMSDH between the rwo neighbor-joinmg 5«B^e. they 
^ c^ercd in the human tree (Fi^4) wh^e th^ h^o^ to 
separate clusters in the general tree (Fig. 3). Since boo^irap P- 
vSaes of these clusters are low (<20%)in both trees,^ on^ 
of these rwo genes is not certain. The branch pomt(s) of ALDH4 
and amiauitin is also not certain. „ ^ _^ 

fe^ for the uncertainties described obove, aU cluster 
branch eoints have very high bootstrap P-vahies, iiidic>«ftg a 
K4« of validity in the tn:cs. The trees tadicate that&v^- 
"iation wiihto the ALDH 1W5/6 cluster was hke^ to oeottr 
much earUer than within the ALDH 3/^0/7/8 dt.3ter^.gs 3 «d 
4). Tn the icconswiction, diversiflcation W>thm ALDH 1/2^5/6 
cUer happened during the Neoptoiozoic ptrtod ((893 nullton 
y6«s ago), long before the Ediaoatan rtdianon of tnettzoan 
\m According to the cuiitsnt paleontological paradigm, the 
eai4 durine this period was dominated by unicellular orgamsms. 
and mulU<^llul(^ organisms, mostly algae, were sc««e and 

"^lif ntultiplicaticn in the ALDH 3n(V7/8 cl>«ter 

scared to arise much later (-287 mlUion ye^ ago) mlhe I^- 
neiozoic period, when dWeise vertebrate and invertebnue a^ 
mals were abundant The latest two dupUcanons ALDH3/ 
ALDHIO and ALDH7/ALDH8 took place about 178 and73 nnl- 
Uonyeara ago. i.e. cotre8po«dkg to the periods of early appear- 
ance and ladialion of mannnalian spsciw- 

The separation of two major groups (i.e. ALDHt/2/S/6/9 and 
ALDH3/10/7/8) had occurred more than two bilhon years ago 
^^i; before the separation of the ancestors of evkaiyotes and 



Studiee hi this labotatoiy jupported by ihe US Pubhe He»ltt> 
Service (gnat HL-2?515 to A. Yoahida, QM20293 to M. N«) snd by 
the Nation*! Science Foundation (gtmit DEB 9520832 10 M. Nei). 
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